Google Books
   HOME

TheInfoList



OR:

Google Books (previously known as Google Book Search, Google Print, and by its code-name Project Ocean) is a service from
Google Inc. Google LLC () is an American multinational technology company focusing on search engine technology, online advertising, cloud computing, computer software, quantum computing, e-commerce, artificial intelligence, and consumer electronics. I ...
that searches the full text of books and magazines that Google has scanned, converted to text using
optical character recognition Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scen ...
(OCR), and stored in its digital database.The basic Google book link is found at: https://books.google.com/ . The "advanced" interface allowing more specific searches is found at: https://books.google.com/advanced_book_search Books are provided either by publishers and authors through the Google Books Partner Program, or by Google's library partners through the Library Project. Additionally, Google has partnered with a number of magazine publishers to digitize their archives. The Publisher Program was first known as Google Print when it was introduced at the
Frankfurt Book Fair The Frankfurt Book Fair (German: Frankfurter Buchmesse, FBM) is the world's largest trade fair for books, based on the number of publishing companies represented. It is considered to be the most important book fair in the world for internationa ...
in October 2004. The Google Books Library Project, which scans works in the collections of library partners and adds them to the digital inventory, was announced in December 2004. The Google Books initiative has been hailed for its potential to offer unprecedented access to what may become the largest online body of human knowledge and promoting the
democratization of knowledge The democratization of knowledge is the acquisition and spread of knowledge amongst a wider part of the population, not just privileged elites such as clergy and academics. Libraries, in particular public libraries, and modern digital technolog ...
.Malte Herwig, "Google's Total Library"
, ''Spiegel Online International'', March 28, 2007.
However, it has also been criticized for potential copyright violations, and lack of editing to correct the many errors introduced into the scanned texts by the OCR process. , Google celebrated 15 years of Google Books and provided the number of scanned books as more than 40 million titles. Google estimated in 2010 that there were about 130 million distinct titles in the world,
PC World
and stated that it intended to scan all of them. However, the scanning process in American academic libraries has slowed since the aughts. Google Book's scanning efforts have been subject to litigation, including ''
Authors Guild v. Google ''Authors Guild v. Google'' 721 F.3d 132 (2d Cir. 2015) was a copyright case heard in the United States District Court for the Southern District of New York, and on appeal to the United States Court of Appeals for the Second Circuit between 2005 ...
'', a class-action lawsuit in the United States, decided in Google's favor (see below). This was a major case that came close to changing copyright practices for
orphan work An orphan work is a copyright-protected work for which rightsholders are positively indeterminate or uncontactable. Sometimes the names of the originators or rightsholders are known, yet it is impossible to contact them because additional details ...
s in the United States.


Details

Results from Google Books show up in both the universal
Google Search Google Search (also known simply as Google) is a search engine provided by Google. Handling more than 3.5 billion searches per day, it has a 92% share of the global search engine market. It is also the most-visited website in the world. The ...
and in the dedicated Google Books search website (''books.google.com''). In response to search queries, Google Books allows users to view full pages from books in which the search terms appear if the book is out of copyright or if the copyright owner has given permission. If Google believes the book is still under copyright, a user sees "snippets" of text around the queried search terms. All instances of the search terms in the book text appear with a yellow highlight. The four access levels used on Google Books are: * Full view: Books in the
public domain The public domain (PD) consists of all the creative work A creative work is a manifestation of creative effort including fine artwork (sculpture, paintings, drawing, sketching, performance art), dance, writing (literature), filmmaking, ...
are available for "full view" and can be downloaded for free. In-print books acquired through the Partner Program are also available for full view if the publisher has given permission, although this is rare. * Preview: For in-print books where permission has been granted, the number of viewable pages is limited to a "preview" set by a variety of access restrictions and security measures, some based on user-tracking. Usually, the publisher can set the percentage of the book available for preview. Users are restricted from copying, downloading or printing book previews. A watermark reading "Copyrighted material" appears at the bottom of pages. All books acquired through the Partner Program are available for preview. * Snippet view: A "snippet view" – two to three lines of text surrounding the queried search term – is displayed in cases where Google does not have permission of the copyright owner to display a preview. This could be because Google cannot identify the owner or the owner declined permission. If a search term appears many times in a book, Google displays no more than three snippets, thus preventing the user from viewing too much of the book. Also, Google does not display any snippets for certain reference books, such as dictionaries, where the display of even snippets can harm the market for the work. Google maintains that no permission is required under copyright law to display the snippet view. * No preview: Google also displays search results for books that have not been digitized. As these books have not been scanned, their text is not searchable and only the
metadata Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive metadata – the descriptive ...
such as the title, author, publisher, number of pages, ISBN, subject and copyright information, and in some cases, a table of contents and book summary is available. In effect, this is similar to an online library card catalog. In response to criticism from groups such as the
American Association of Publishers The Association of American Publishers (AAP) is the national trade association of the American book publishing industry. AAP lobbies for book, journal, and education publishers in the United States. AAP members include most of the major commercial ...
and the
Authors Guild The Authors Guild is America's oldest and largest professional organization for writers and provides advocacy on issues of free expression and copyright protection. Since its founding in 1912 as the Authors League of America, it has counted among ...
, Google announced an
opt-out The term opt-out refers to several methods by which individuals can avoid receiving unsolicited product or service information. This option is usually associated with direct marketing campaigns such as e-mail marketing or direct mail. A list of thos ...
policy in August 2005, through which copyright owners could provide a list of titles that they do not want scanned, and the request would be respected. The company also stated that it would not scan any in-copyright books between August and 1 November 2005, to provide the owners with the opportunity to decide which books to exclude from the Project. Thus, copyright owners have three choices with respect to any work: # It can participate in the Partner Program to make a book available for preview or full view, in which case it would share revenue derived from the display of pages from the work in response to user queries. # It can let Google scan the book under the Library Project and display snippets in response to user queries. # It can opt out of the Library Project, in which case Google will not scan the book. If the book has already been scanned, Google will reset its access level as 'No preview'. Most scanned works are no longer in print or commercially available. In addition to procuring books from libraries, Google also obtains books from its publisher partners, through the "Partner Program" – designed to help publishers and authors promote their books. Publishers and authors submit either a digital copy of their book in
EPUB EPUB is an e-book file format that uses the ".epub" file extension. The term is short for ''electronic publication'' and is sometimes styled ''ePub''. EPUB is supported by many e-readers, and compatible software is available for most smartphones ...
or
PDF Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. ...
format, or a print copy to Google, which is made available on Google Books for preview. The publisher can control the percentage of the book available for preview, with the minimum being 20%. They can also choose to make the book fully viewable, and even allow users to download a PDF copy. Books can also be made available for sale on Google Play. Unlike the Library Project, this does not raise any copyright concerns as it is conducted pursuant to an agreement with the publisher. The publisher can choose to withdraw from the agreement at any time. For many books, Google Books displays the original page numbers. However,
Tim Parks Timothy Harold Parks (born 19 December 1954) is a British novelist, translator, author and professor of literature. Career He is the author of eighteen novels (notably ''Europa'', which was shortlisted for the Booker Prize in 1997). His first ...
, writing in ''The New York Review of Books'' in 2014, noted that Google had stopped providing page numbers for many recent publications (likely the ones acquired through the Partner Program) "presumably in alliance with the publishers, in order to force those of us who need to prepare footnotes to buy paper editions."


Scanning of books

The project began in 2002 under the codename Project Ocean. Google co-founder
Larry Page Lawrence Edward Page (born March 26, 1973) is an American business magnate, computer scientist and internet entrepreneur. He is best known for co-founding Google with Sergey Brin. Page was the chief executive officer of Google from 1997 unt ...
had always had an interest in digitizing books. When he and
Marissa Mayer Marissa Ann Mayer (; born May 30, 1975) is an American businesswoman and investor. She is an information technology executive, and co-founder of Sunshine Contacts. Mayer formerly served as the president and chief executive officer of Yahoo!, a p ...
began experimenting with
book scanning Book scanning or book digitization (also: magazine scanning or magazine digitization) is the process of converting physical books and magazines into digital media such as images, electronic text, or electronic books (e-books) by using an imag ...
in 2002, it took 40 minutes for them to digitize a 300-page book. But soon after the technology had been developed to the extent that scanning operators could scan up to 6000 pages an hour. Google established designated scanning centers to which books were transported by trucks. The stations could digitize at the rate of 1,000 pages per hour. The books were placed in a custom-built mechanical cradle that adjusted the book spine in place while an array of lights and optical instruments scanned the two open pages. Each page would have two cameras directed at it capturing the image, while a range finder
LIDAR Lidar (, also LIDAR, or LiDAR; sometimes LADAR) is a method for determining ranges (variable distance) by targeting an object or a surface with a laser and measuring the time for the reflected light to return to the receiver. It can also be ...
overlaid a three-dimensional laser grid on the book's surface to capture the curvature of the paper. A human operator would turn the pages by hand, using a foot pedal to take the photographs. With no need to flatten the pages or align them perfectly, Google's system not only reached a remarkable efficiency and speed but also helped protect the fragile collections from being over-handled. Afterwards, the crude images went through three levels of processing: first, de-warping algorithms used the LIDAR data fix the pages' curvature. Then,
optical character recognition Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scen ...
(OCR) software transformed the raw images into text, and, lastly, another round of algorithms extracted page numbers, footnotes, illustrations and diagrams. Many of the books are scanned using a customized
Elphel Elphel, Inc. designs and manufactures open hardware and free software cameras. The company was founded in 2001 by Russian physicist Andrey Filippov, who emigrated to the US in 1995. Elphel cameras have been used to capture images for Google Stre ...
323 camera at a rate of 1,000 pages per hour. A
patent A patent is a type of intellectual property that gives its owner the legal right to exclude others from making, using, or selling an invention for a limited period of time in exchange for publishing an enabling disclosure of the invention."A p ...
awarded to Google in 2009 revealed that Google had come up with an innovative system for scanning books that uses two cameras and infrared light to automatically correct for the curvature of pages in a book. By constructing a 3D model of each page and then "de-warping" it, Google is able to present flat-looking pages without having to really make the pages flat, which requires the use of destructive methods such as unbinding or glass plates to individually flatten each page, which is inefficient for large scale scanning. Google decided to omit color information in favour of better spatial resolution, as most out-of-copyright books at the time did not contain colors. Each page image was passed through algorithms that distinguished the text and illustration regions. Text regions were then processed via OCR to enable full-text searching. Google expended considerable resources in coming up with optimal compression techniques, aiming for high image quality while keeping the file sizes minimal to enable access by internet users with low bandwidth.


Website functionality

For each work, Google Books automatically generates an overview page. This page displays information extracted from the book—its publishing details, a high frequency word map, the table of contents—as well as secondary material, such as summaries, reader reviews (not readable in the mobile version of the website), and links to other relevant texts. A visitor to the page, for instance, might see a list of books that share a similar genre and theme, or they might see a list of current scholarship on the book. This content, moreover, offers interactive possibilities for users signed into their
Google account A Google Account is a user account that is required for access, authentication and authorization to certain online Google services. It is also often used as single sign on for third party services. Usage A Google Account is required for Gmail, ...
. They can export the bibliographic data and
citation A citation is a reference to a source. More precisely, a citation is an abbreviated alphanumeric expression embedded in the body of an intellectual work that denotes an entry in the bibliographic references section of the work for the purpose of ...
s in standard formats, write their own reviews, add it to their library to be tagged, organized, and shared with other people. Thus, Google Books collects these more interpretive elements from a range of sources, including the users, third-party sites like
Goodreads Goodreads is an American social cataloging website and a subsidiary of Amazon that allows individuals to search its database of books, annotations, quotes, and reviews. Users can sign up and register books to generate library catalogs and read ...
, and often the book's author and publisher. In fact, to encourage authors to upload their own books, Google has added several functionalities to the website. The authors can allow visitors to download their ebook for free, or they can set their own purchase price. They can change the price back and forth, offering discounts whenever it suits them. Also, if a book's author chooses to add an
ISBN The International Standard Book Number (ISBN) is a numeric commercial book identifier that is intended to be unique. Publishers purchase ISBNs from an affiliate of the International ISBN Agency. An ISBN is assigned to each separate edition and ...
,
LCCN The Library of Congress Control Number (LCCN) is a serially based system of numbering cataloged records in the Library of Congress, in the United States. It is not related to the contents of any book, and should not be confused with Library of ...
or
OCLC OCLC, Inc., doing business as OCLC, See also: is an American nonprofit cooperative organization "that provides shared technology services, original research, and community programs for its membership and the library community at large". It was ...
record number, the service will update the book's url to include it. Then, the author can set a specific page as the link's anchor. This option makes their book more easily discoverable.


Ngram Viewer

The Ngram Viewer is a service connected to Google Books that graphs the frequency of word usage across their book collection. The service is important for historians and linguists as it can provide an inside look into human culture through word use throughout time periods. This program has fallen under criticism because of errors in the metadata used in the program.


Content issues and criticism

The project has received criticism that its stated aim of preserving orphaned and out-of-print works is at risk due to scanned data having errors and such problems not being solved. Users can report errors in Google scanned books at
support.google.com/books/partner/troubleshooter/2983879
'.


Scanning errors

The scanning process is subject to errors. For example, some pages may be unreadable, upside down, or in the wrong order. Scholars have even reported crumpled pages, obscuring thumbs and fingers, and smeared or blurry images. On this issue, a declaration from Google at the end of scanned books says: As of 2009, Google stated that they would start using
reCAPTCHA reCAPTCHA is a CAPTCHA system that enables web hosts to distinguish between human and automated access to websites. The original version asked users to decipher hard to read text or match images. Version 2 also asked users to decipher text or ...
to help fix the errors found in Google Book scans. This method would only improve scanned words that are hard to recognize because of the scanning process and cannot solve errors such as turned pages or blocked words. Scanning errors have inspired works of art such as published collections of anomalous pages and a
Tumblr Tumblr (stylized as tumblr; pronounced "tumbler") is an American microblogging and social networking website founded by David Karp in 2007 and currently owned by Automattic. The service allows users to post multimedia and other content to a sho ...
blog.


Errors in metadata

Scholars have frequently reported rampant errors in the
metadata Metadata is "data that provides information about other data", but not the content of the data, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive metadata – the descriptive ...
information on Google Books – including misattributed authors and erroneous dates of publication.
Geoffrey Nunberg Geoffrey Nunberg (June 1, 1945– August 11, 2020) was an American lexical semantician and author. In 2001 he received the Linguistics, Language, and the Public Interest Award from the Linguistic Society of America for his contributions to Natio ...
, a linguist researching on the changes in word usage over time noticed that a search for books published before 1950 and containing the word "internet" turned up an unlikely 527 results. Woody Allen is mentioned in 325 books ostensibly published before he was born. Google responded to Nunberg by blaming the bulk of errors on the outside contractors. Other metadata errors reported include publication dates before the author's birth (e.g. 182 works by Charles Dickens prior to his birth in 1812); incorrect subject classifications (an edition of ''Moby Dick'' found under "computers", a biography of Mae West classified under "religion"), conflicting classifications (10 editions of Whitman's ''Leaves of Grass'' all classified as both "fiction" and "nonfiction"), incorrectly spelled titles, authors, and publishers (''Moby Dick: or the White "Wall"''), and metadata for one book incorrectly appended to a completely different book (the metadata for an 1818 mathematical work leads to a 1963 romance novel). Metadata errors based on incorrect scanned dates makes research using the Google Books Project database difficult. Google has shown only limited interest in cleaning up these errors.


Language issues

Some European politicians and intellectuals have criticized Google's effort on
linguistic imperialism Linguistic imperialism or language imperialism is occasionally defined as "the transfer of a dominant language to other people". This language "transfer" (or rather unilateral imposition) comes about because of imperialism. The transfer is consid ...
grounds. They argue that because the vast majority of books proposed to be scanned are in English, it will result in disproportionate representation of natural languages in the digital world. German, Russian, French, and Spanish, for instance, are popular languages in scholarship. The disproportionate online emphasis on English, however, could shape access to historical scholarship, and, ultimately, the growth and direction of future scholarship. Among these critics is
Jean-Noël Jeanneney Jean-Noël Jeanneney (born 2 April 1942, in Grenoble) is a French historian and politician. He is the son of Jean-Marcel Jeanneney and the grandson of Jules Jeanneney, both important figures in French politics. Education After his secondary schoo ...
, the former president of the ''
Bibliothèque nationale de France The Bibliothèque nationale de France (, 'National Library of France'; BnF) is the national library of France, located in Paris on two main sites known respectively as ''Richelieu'' and ''François-Mitterrand''. It is the national repository ...
''.


Google Books versus Google Scholar

While Google Books has digitized large numbers of journal back issues, its scans do not include the metadata required for identifying specific articles in specific issues. This has led the makers of
Google Scholar Google Scholar is a freely accessible web search engine that indexes the full text or metadata of scholarly literature across an array of publishing formats and disciplines. Released in beta in November 2004, the Google Scholar index includes p ...
to start their own program to digitize and host older journal articles (in agreement with their publishers).


Library partners

The Google Books Library Project is aimed at scanning and making searchable the collections of several major research
libraries A library is a collection of materials, books or media that are accessible for use and not just for display purposes. A library provides physical (hard copies) or digital access (soft copies) materials, and may be a physical location or a vir ...
. Along with
bibliographic Bibliography (from and ), as a discipline, is traditionally the academic study of books as physical, cultural objects; in this sense, it is also known as bibliology (from ). English author and bibliographer John Carter describes ''bibliography ...
information, snippets of text from a book are often viewable. If a book is out of
copyright A copyright is a type of intellectual property that gives its owner the exclusive right to copy, distribute, adapt, display, and perform a creative work, usually for a limited time. The creative work may be in a literary, artistic, education ...
and in the public domain, the book is fully available to read or
download In computer networks, download means to ''receive'' data from a remote system, typically a server such as a web server, an FTP server, an email server, or other similar system. This contrasts with uploading, where data is ''sent to'' a remote s ...
. In-copyright books scanned through the Library Project are made available on Google Books for snippet view. Regarding the quality of scans, Google acknowledges that they are "not always of sufficiently high quality" to be offered for sale on Google Play. Also, because of supposed technical constraints, Google does not replace scans with higher quality versions that may be provided by the publishers. The project is the subject of the ''
Authors Guild v. Google ''Authors Guild v. Google'' 721 F.3d 132 (2d Cir. 2015) was a copyright case heard in the United States District Court for the Southern District of New York, and on appeal to the United States Court of Appeals for the Second Circuit between 2005 ...
'' lawsuit, filed in 2005 and ruled in favor of Google in 2013, and again, on appeal, in 2015. Copyright owners can claim the rights for a scanned book and make it available for preview or full view (by "transferring" it to their Partner Program account), or request Google to prevent the book text from being searched. The number of institutions participating in the Library Project has grown since its inception.


Initial partners

*
Harvard University Harvard University is a private Ivy League research university in Cambridge, Massachusetts. Founded in 1636 as Harvard College and named for its first benefactor, the Puritan clergyman John Harvard, it is the oldest institution of higher le ...
,
Harvard University Library Harvard Library is the umbrella organization for Harvard University's libraries and services. It is the oldest library system in the United States and both the largest academic library and largest private library in the world. Its collection ...
*: The Harvard University Library and Google conducted a pilot throughout 2005. The project continued, with the aim of increasing online access to the holdings of the Harvard University Library, which includes more than 15.8 million volumes. While physical access to Harvard's library materials is generally restricted to current Harvard students, faculty, and researchers, or to scholars who can come to Cambridge, the Harvard-Google Project has been designed to enable both members of the Harvard community and users everywhere to discover works in the Harvard collection. *
University of Michigan , mottoeng = "Arts, Knowledge, Truth" , former_names = Catholepistemiad, or University of Michigania (1817–1821) , budget = $10.3 billion (2021) , endowment = $17 billion (2021)As o ...
,
University of Michigan Library The University of Michigan Library is the academic library system of the University of Michigan. The university's 38 constituent and affiliated libraries together make it the second largest research library by number of volumes in the United State ...
*: As of March 2012, 5.5 million volumes were scanned. *
New York Public Library The New York Public Library (NYPL) is a public library system in New York City. With nearly 53 million items and 92 locations, the New York Public Library is the second largest public library in the United States (behind the Library of Congress ...
*: In this pilot program, NYPL is working with Google to offer a collection of its public domain books, which will be scanned in their entirety and made available for free to the public online. Users will be able to search and browse the full text of these works. When the scanning process is complete, the books may be accessed from both The New York Public Library's website and from the Google search engine. *
University of Oxford , mottoeng = The Lord is my light , established = , endowment = £6.1 billion (including colleges) (2019) , budget = £2.145 billion (2019–20) , chancellor ...
,
Bodleian Library The Bodleian Library () is the main research library of the University of Oxford, and is one of the oldest libraries in Europe. It derives its name from its founder, Sir Thomas Bodley. With over 13 million printed items, it is the second- ...
*
Stanford University Stanford University, officially Leland Stanford Junior University, is a private research university in Stanford, California. The campus occupies , among the largest in the United States, and enrolls over 17,000 students. Stanford is consider ...
,
Stanford University Libraries The Stanford University Libraries (SUL), formerly known as "Stanford University Libraries and Academic Information Resources" ("SULAIR"), is the library system of Stanford University in California. It encompasses more than 24 libraries in all. Sev ...
( SULAIR)


Additional partners

Other institutional partners have joined the project since the partnership was first announced: *
Austrian National Library The Austrian National Library (german: Österreichische Nationalbibliothek) is the largest library in Austria, with more than 12 million items in its various collections. The library is located in the Neue Burg Wing of the Hofburg in center of V ...
*
Bavarian State Library The Bavarian State Library (german: Bayerische Staatsbibliothek, abbreviated BSB, called ''Bibliotheca Regia Monacensis'' before 1919) in Munich is the central " Landesbibliothek", i. e. the state library of the Free State of Bavaria, the bigg ...
*
Bibliothèque municipale de Lyon The Bibliothèque municipale de Lyon is a library in Lyon, France. In addition to providing standard library services it also hosts a variety of special collections, in particular in the fields of photography, Lyon and the Rhône-Alpes Rhône ...
*
Big Ten Academic Alliance The Big Ten Academic Alliance (BTAA), formerly the Committee on Institutional Cooperation (CIC), is the academic consortium of the universities in the Big Ten Conference. The consortium was renamed on June 29, 2016. Member universities The B ...
*
Columbia University Columbia University (also known as Columbia, and officially as Columbia University in the City of New York) is a private research university in New York City. Established in 1754 as King's College on the grounds of Trinity Church in Manhatt ...
,
Columbia University Library System Columbia University Libraries is the library system of Columbia University and one of the largest academic library systems in North America. With 15.0 million volumes and over 160,000 journals and serials, as well as extensive electronic resources ...
*
Complutense University of Madrid The Complutense University of Madrid ( es, Universidad Complutense de Madrid; UCM, links=no, ''Universidad de Madrid'', ''Universidad Central de Madrid''; la, Universitas Complutensis Matritensis, links=no) is a public research university loca ...
*
Cornell University Cornell University is a private statutory land-grant research university based in Ithaca, New York. It is a member of the Ivy League. Founded in 1865 by Ezra Cornell and Andrew Dickson White, Cornell was founded with the intention to teach an ...
,
Cornell University Library The Cornell University Library is the library system of Cornell University. As of 2014, it holds over 8 million printed volumes and over a million ebooks. More than 90 percent of its current 120,000 Periodical literature, periodical titles are ...
*
Ghent University Ghent University ( nl, Universiteit Gent, abbreviated as UGent) is a public research university located in Ghent, Belgium. Established before the state of Belgium itself, the university was founded by the Dutch King William I in 1817, when the ...
,
Ghent University Library Ghent University Library ( nl, Universiteitsbibliotheek Gent) is located in the city of Ghent, Belgium. It serves the university community of students and scholarly researchers. History After Ghent University was founded in 1817, books confiscated ...
/
Boekentoren The Boekentoren (Dutch for ''Book Tower'') is a famous building located in Ghent, Belgium, designed by the Belgian architect Henry van de Velde. It is part of the Ghent University Library and currently houses 3 million books. The Boekentoren is di ...
*
Keio University , mottoeng = The pen is mightier than the sword , type = Private research coeducational higher education institution , established = 1858 , founder = Yukichi Fukuzawa , endowmen ...
,
Keio Media Centers (Libraries) Keio Media Centers is the English name used by Keio University to describe its library system. The Media Centers (libraries) on the various Keio campuses are important information resources for students, faculty, and researchers. Together, they co ...
*
National Library of Catalonia The Library of Catalonia ( ca, Biblioteca de Catalunya, ) is the Catalan national library, located in Barcelona, Catalonia, Spain. The primary mission of the Library of Catalonia is to collect, preserve, and spread Catalan bibliographic producti ...
, ''
Biblioteca de Catalunya The Library of Catalonia ( ca, Biblioteca de Catalunya, ) is the Catalan national library, located in Barcelona, Catalonia, Spain. The primary mission of the Library of Catalonia is to collect, preserve, and spread Catalan bibliographic producti ...
'' *
Princeton University Princeton University is a private university, private research university in Princeton, New Jersey. Founded in 1746 in Elizabeth, New Jersey, Elizabeth as the College of New Jersey, Princeton is the List of Colonial Colleges, fourth-oldest ins ...
,
Princeton University Library Princeton University Library is the main library system of Princeton University. With holdings of more than 7 million books, 6 million microforms, and 48,000 linear feet of manuscripts, it is among the largest libraries in the world by number of ...
*
University of California The University of California (UC) is a public land-grant research university system in the U.S. state of California. The system is composed of the campuses at Berkeley, Davis, Irvine, Los Angeles, Merced, Riverside, San Diego, San Francisco, ...
,
California Digital Library The California Digital Library (CDL) was founded by the University of California in 1997. Under the leadership of then UC President Richard C. Atkinson, the CDL's original mission was to forge a better system for scholarly information management a ...
*
University of Lausanne The University of Lausanne (UNIL; french: links=no, Université de Lausanne) in Lausanne, Switzerland was founded in 1537 as a school of Protestant theology, before being made a university in 1890. The university is the second oldest in Switzer ...
,
Cantonal and University Library of Lausanne The Cantonal and University Library of Lausanne (''Bibliothèque cantonale et universitaire de Lausanne'', BCU) was founded in the 16th century and became one of the most important public libraries in Switzerland. History The University of L ...
*
University of Mysore The University of Mysore is a public state university in Mysore, Karnataka, India. The university was founded during the reign of Krishnaraja Wadiyar IV, the Maharaja of Mysore. The university is recognised by the University Grants Commission ...
,
Mysore University Library The Mysore University Library serves the academic community of the University of Mysore at the located in Mysore, Hassan and Mandya. The Library is the largest and also oldest among the University Libraries in the southern Indian State of Karnat ...
*: The partnership was for digitizing 800,000 texts, including manuscripts written in palm leaves dating back to 8th century. *
University of Texas at Austin The University of Texas at Austin (UT Austin, UT, or Texas) is a public research university in Austin, Texas. It was founded in 1883 and is the oldest institution in the University of Texas System. With 40,916 undergraduate students, 11,075 ...
, University of Texas Libraries *:The partnership was for digitizing the library's Latin American collection – about half a million volumes. *
University of Virginia The University of Virginia (UVA) is a Public university#United States, public research university in Charlottesville, Virginia. Founded in 1819 by Thomas Jefferson, the university is ranked among the top academic institutions in the United S ...
,
University of Virginia Library The University of Virginia (UVA) is a public research university in Charlottesville, Virginia. Founded in 1819 by Thomas Jefferson, the university is ranked among the top academic institutions in the United States, with highly selective adm ...
*
University of Wisconsin–Madison A university () is an educational institution, institution of higher education, higher (or Tertiary education, tertiary) education and research which awards academic degrees in several Discipline (academia), academic disciplines. Universities ty ...
, University of Wisconsin Libraries *:As of March 2012, about 600,000 volumes had been scanned.


History

2002: A group of team members at Google officially launch the "secret 'books' project." Google founders
Sergey Brin Sergey Mikhailovich Brin (russian: link=no, Сергей Михайлович Брин; born August 21, 1973) is an American business magnate, computer scientist, and internet entrepreneur, who co-founded Google with Larry Page. Brin was the ...
and
Larry Page Lawrence Edward Page (born March 26, 1973) is an American business magnate, computer scientist and internet entrepreneur. He is best known for co-founding Google with Sergey Brin. Page was the chief executive officer of Google from 1997 unt ...
came up with the idea that later became Google Books while still graduate students at Stanford in 1996. The history page on the Google Books website describes their initial vision for this project: "in a future world in which vast collections of books are digitized, people would use a '
web crawler A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses the World Wide Web and that is typically operated by search engines for the purpose of Web indexing (''web spid ...
' to index the books' content and analyze the connections between them, determining any given book's relevance and usefulness by tracking the number and quality of citations from other books." This team visited the sites of some of the larger digitization efforts at that time including the Library of Congress's American Memory Project,
Project Gutenberg Project Gutenberg (PG) is a Virtual volunteering, volunteer effort to digitize and archive cultural works, as well as to "encourage the creation and distribution of eBooks." It was founded in 1971 by American writer Michael S. Hart and is the ...
, and the Universal Library to find out how they work, as well as the University of Michigan, Page's alma mater, and the base for such digitization projects as
JSTOR JSTOR (; short for ''Journal Storage'') is a digital library founded in 1995 in New York City. Originally containing digitized back issues of academic journals, it now encompasses books and other primary sources as well as current issues of j ...
and Making of America. In a conversation with the at that time University President
Mary Sue Coleman Mary Sue Wilson Coleman (born October 2, 1943) is an American chemist and academic administrator who served as the president of the University of Iowa from 1995 to 2002, the 13th president of the University of Michigan from 2002 to 2014, and as ...
, when Page found out that the university's current estimate for scanning all the library's volumes was 1,000 years, Page reportedly told Coleman that he "believes Google can help make it happen in six." 2003: The team works to develop a high-speed scanning process as well as software for resolving issues in odd type sizes, unusual fonts, and "other unexpected peculiarities." December 2004: Google signaled an extension to its Google Print initiative known as the Google Print Library Project.O'Sullivan, Joseph and Adam Smith
"All booked up,"
''Googleblog.'' December 14, 2004.
Google announced partnerships with several high-profile university and public libraries, including the
University of Michigan , mottoeng = "Arts, Knowledge, Truth" , former_names = Catholepistemiad, or University of Michigania (1817–1821) , budget = $10.3 billion (2021) , endowment = $17 billion (2021)As o ...
,
Harvard Harvard University is a private Ivy League research university in Cambridge, Massachusetts. Founded in 1636 as Harvard College and named for its first benefactor, the Puritan clergyman John Harvard, it is the oldest institution of higher le ...
(
Harvard University Library Harvard Library is the umbrella organization for Harvard University's libraries and services. It is the oldest library system in the United States and both the largest academic library and largest private library in the world. Its collection ...
),
Stanford Stanford University, officially Leland Stanford Junior University, is a private research university in Stanford, California. The campus occupies , among the largest in the United States, and enrolls over 17,000 students. Stanford is considere ...
(
Green Library The Cecil H. Green Library (commonly known as Green Library) is the main library on the Stanford University campus and is part of the SUL system. It is named for Cecil H. Green. Green Library houses 4 million volumes, most of which are relate ...
),
Oxford Oxford () is a city in England. It is the county town and only city of Oxfordshire. In 2020, its population was estimated at 151,584. It is north-west of London, south-east of Birmingham and north-east of Bristol. The city is home to the ...
(
Bodleian Library The Bodleian Library () is the main research library of the University of Oxford, and is one of the oldest libraries in Europe. It derives its name from its founder, Sir Thomas Bodley. With over 13 million printed items, it is the second- ...
), and the
New York Public Library The New York Public Library (NYPL) is a public library system in New York City. With nearly 53 million items and 92 locations, the New York Public Library is the second largest public library in the United States (behind the Library of Congress ...
. According to press releases and university librarians, Google planned to digitize and make available through its Google Books service approximately 15 million volumes within a decade. The announcement soon triggered controversy, as publisher and author associations challenged Google's plans to digitize, not just books in the public domain, but also titles still under copyright. September–October 2005: Two lawsuits against Google charge that the company has not respected
copyright A copyright is a type of intellectual property that gives its owner the exclusive right to copy, distribute, adapt, display, and perform a creative work, usually for a limited time. The creative work may be in a literary, artistic, education ...
s and has failed to properly compensate authors and publishers. One is a class action suit on behalf of authors (Authors Guild v. Google, September 20, 2005) and the other is a civil lawsuit brought by five large publishers and the
Association of American Publishers The Association of American Publishers (AAP) is the national trade association of the American book publishing industry. AAP lobbies for book, journal, and education publishers in the United States. AAP members include most of the major commercia ...
. (
McGraw Hill v. Google McGraw or MacGraw may refer to: * McGraw (surname) * McGraw, New York * McGraw-Hill Education, a publishing and education corporation See also * McGrawville, New York McGrawville (also New Hudson) is a former Hamlet (New York), hamlet in the t ...
, October 19, 2005)Copyright infringement suits against Google and their settlement: November 2005: Google changed the name of this service from Google Print to Google Book Search. Its program enabling publishers and authors to include their books in the service was renamed Google Books Partner Program, and the partnership with libraries became
Google Books Library Project Google Books (previously known as Google Book Search, Google Print, and by its code-name Project Ocean) is a service from Google Inc. that searches the full text of books and magazines that Google has scanned, converted to text using optical c ...
. 2006: Google added a "download a pdf" button to all its out-of-copyright, public domain books. It also added a new browsing interface along with new "About this Book" pages. August 2006: The
University of California System The University of California (UC) is a public land-grant research university system in the U.S. state of California. The system is composed of the campuses at Berkeley, Davis, Irvine, Los Angeles, Merced, Riverside, San Diego, San Francisco, ...
announced that it would join the Books digitization project. This includes a portion of the 34 million volumes within the approximately 100 libraries managed by the System. September 2006: The
Complutense University of Madrid The Complutense University of Madrid ( es, Universidad Complutense de Madrid; UCM, links=no, ''Universidad de Madrid'', ''Universidad Central de Madrid''; la, Universitas Complutensis Matritensis, links=no) is a public research university loca ...
became the first Spanish-language library to join the Google Books Library Project. October 2006: The
University of Wisconsin–Madison A university () is an educational institution, institution of higher education, higher (or Tertiary education, tertiary) education and research which awards academic degrees in several Discipline (academia), academic disciplines. Universities ty ...
announced that it would join the Book Search digitization project along with the
Wisconsin Historical Society The Wisconsin Historical Society (officially the State Historical Society of Wisconsin) is simultaneously a state agency and a private membership organization whose purpose is to maintain, promote and spread knowledge relating to the history of N ...
Library. Combined, the libraries have 7.2 million holdings. November 2006: The
University of Virginia The University of Virginia (UVA) is a Public university#United States, public research university in Charlottesville, Virginia. Founded in 1819 by Thomas Jefferson, the university is ranked among the top academic institutions in the United S ...
joined the project. Its libraries contain more than five million volumes and more than 17 million manuscripts, rare books and archives. January 2007: The
University of Texas at Austin The University of Texas at Austin (UT Austin, UT, or Texas) is a public research university in Austin, Texas. It was founded in 1883 and is the oldest institution in the University of Texas System. With 40,916 undergraduate students, 11,075 ...
announced that it would join the Book Search digitization project. At least one million volumes would be digitized from the university's 13 library locations. March 2007: The
Bavarian State Library The Bavarian State Library (german: Bayerische Staatsbibliothek, abbreviated BSB, called ''Bibliotheca Regia Monacensis'' before 1919) in Munich is the central " Landesbibliothek", i. e. the state library of the Free State of Bavaria, the bigg ...
announced a partnership with Google to scan more than a million public domain and out-of-print works in German as well as English, French, Italian, Latin, and Spanish. May 2007: A book digitizing project partnership was announced jointly by Google and the
Cantonal and University Library of Lausanne The Cantonal and University Library of Lausanne (''Bibliothèque cantonale et universitaire de Lausanne'', BCU) was founded in the 16th century and became one of the most important public libraries in Switzerland. History The University of L ...
. May 2007: The
Boekentoren The Boekentoren (Dutch for ''Book Tower'') is a famous building located in Ghent, Belgium, designed by the Belgian architect Henry van de Velde. It is part of the Ghent University Library and currently houses 3 million books. The Boekentoren is di ...
Library of
Ghent University Ghent University ( nl, Universiteit Gent, abbreviated as UGent) is a public research university located in Ghent, Belgium. Established before the state of Belgium itself, the university was founded by the Dutch King William I in 1817, when the ...
announced that it would participate with Google in digitizing and making digitized versions of 19th century books in the French and Dutch languages available online. May 2007: Mysore University announces Google will digitize over 800,000 books and manuscripts–including around 100,000 manuscripts written in Sanskrit or Kannada on both paper and palm leaves. June 2007: The
Committee on Institutional Cooperation The Big Ten Academic Alliance (BTAA), formerly the Committee on Institutional Cooperation (CIC), is the academic consortium of the universities in the Big Ten Conference. The consortium was renamed on June 29, 2016. Member universities The Bi ...
(rebranded as the
Big Ten Academic Alliance The Big Ten Academic Alliance (BTAA), formerly the Committee on Institutional Cooperation (CIC), is the academic consortium of the universities in the Big Ten Conference. The consortium was renamed on June 29, 2016. Member universities The B ...
in 2016) announced that its twelve member libraries would participate in scanning 10 million books over the course of the next six years. July 2007:
Keio University , mottoeng = The pen is mightier than the sword , type = Private research coeducational higher education institution , established = 1858 , founder = Yukichi Fukuzawa , endowmen ...
became Google's first library partner in
Japan Japan ( ja, 日本, or , and formally , ''Nihonkoku'') is an island country in East Asia. It is situated in the northwest Pacific Ocean, and is bordered on the west by the Sea of Japan, while extending from the Sea of Okhotsk in the north ...
with the announcement that they would digitize at least 120,000 public domain books. August 2007: Google announced that it would digitize up to 500,000 both copyrighted and public domain items from
Cornell University Library The Cornell University Library is the library system of Cornell University. As of 2014, it holds over 8 million printed volumes and over a million ebooks. More than 90 percent of its current 120,000 Periodical literature, periodical titles are ...
. Google would also provide a digital copy of all works scanned to be incorporated into the university's own library system. September 2007: Google added a feature that allows users to share snippets of books that are in the public domain. The snippets may appear exactly as they do in the scan of the book, or as plain text. September 2007: Google debuted a new feature called "My Library" which allows users to create personal customized libraries, selections of books that they can label, review, rate, or full-text search. December 2007:
Columbia University Columbia University (also known as Columbia, and officially as Columbia University in the City of New York) is a private research university in New York City. Established in 1754 as King's College on the grounds of Trinity Church in Manhatt ...
was added as a partner in digitizing public domain works. May 2008:
Microsoft Microsoft Corporation is an American multinational technology corporation producing computer software, consumer electronics, personal computers, and related services headquartered at the Microsoft Redmond campus located in Redmond, Washing ...
tapered off and planned to end its scanning project, which had reached 750,000 books and 80 million journal articles. October 2008: A
settlement Settlement may refer to: *Human settlement, a community where people live *Settlement (structural), the distortion or disruption of parts of a building *Closing (real estate), the final step in executing a real estate transaction *Settlement (fina ...
was reached between the publishing industry and Google after two years of negotiation. Google agreed to compensate authors and publishers in exchange for the right to make millions of books available to the public. October 2008: The
HathiTrust HathiTrust Digital Library is a large-scale collaborative repository of digital content from research libraries including content digitized via Google Books and the Internet Archive digitization initiatives, as well as content digitized locally ...
"Shared Digital Repository" (later known as the HathiTrust Digital Library) is launched jointly by the
Committee on Institutional Cooperation The Big Ten Academic Alliance (BTAA), formerly the Committee on Institutional Cooperation (CIC), is the academic consortium of the universities in the Big Ten Conference. The consortium was renamed on June 29, 2016. Member universities The Bi ...
and the 11 university libraries in the
University of California system The University of California (UC) is a public land-grant research university system in the U.S. state of California. The system is composed of the campuses at Berkeley, Davis, Irvine, Los Angeles, Merced, Riverside, San Diego, San Francisco, ...
, all of which were Google partner libraries, in order to archive and provide academic access to books from their collections scanned by Google and others. November 2008: Google reached the 7 million book mark for items scanned by Google and by their publishing partners. 1 million were in full preview mode and 1 million were fully viewable and downloadable public domain works. About five million were
out of print __NOTOC__ An out-of-print (OOP) or out-of-commerce item or work is something that is no longer being published. The term applies to all types of printed matter, visual media, sound recordings, and video recordings. An out-of-print book is a book ...
. December 2008: Google announced the inclusion of magazines in Google Books. Titles include ''
New York Magazine ''New York'' is an American biweekly magazine concerned with life, culture, politics, and style generally, and with a particular emphasis on New York City. Founded by Milton Glaser and Clay Felker in 1968 as a competitor to ''The New Yorker'', ...
'', ''
Ebony Ebony is a dense black/brown hardwood, coming from several species in the genus ''Diospyros'', which also contains the persimmons. Unlike most woods, ebony is dense enough to sink in water. It is finely textured and has a mirror finish when pol ...
'', and ''
Popular Mechanics ''Popular Mechanics'' (sometimes PM or PopMech) is a magazine of popular science and technology, featuring automotive, home, outdoor, electronics, science, do-it-yourself, and technology topics. Military topics, aviation and transportation o ...
'' February 2009: Google launched a mobile version of Google Book Search, allowing iPhone and Android phone users to read over 1.5 million public domain works in the US (and over 500,000 outside the US) using a mobile browser. Instead of page images, the plain text of the book is displayed. May 2009: At the annual
BookExpo BookExpo America (commonly referred to within the book publishing industry as BEA) was an annual book trade fair in the United States. BEA is almost always held in a major city over four days in late May and/or early June. Nearly all significant ...
convention in New York, Google signaled its intent to introduce a program that would enable publishers to sell digital versions of their newest books direct to consumers through Google. December 2009: A French court shut down the scanning of copyrighted books published in France, saying this violated copyright laws. It was the first major legal loss for the scanning project. April 2010: Visual artists were not included in the previous lawsuit and settlement, are the plaintiff groups in another lawsuit, and say they intend to bring more than just Google Books under scrutiny. "The new class action," read the statement, "goes beyond Google's Library Project, and includes Google's other systematic and pervasive infringements of the rights of photographers, illustrators and other visual artists." May 2010: It was reported that Google would launch a digital book store called
Google Editions Google Play Books, formerly Google eBooks, is an ebook digital distribution service operated by Google, part of its Google Play product line. Users can purchase and download ebooks and audiobooks from Google Play, which offers over five millio ...
. It would compete with Amazon, Barnes & Noble, Apple and other electronic book retailers with its own e-book store. Unlike others, Google Editions would be completely online and would not require a specific device (such as kindle, Nook, or iPad). June 2010: Google passed 12 million books scanned. August 2010: It was announced that Google intends to scan all known existing 129,864,880 books within a decade, amounting to over 4 billion
digital page Pagination, also known as paging, is the process of dividing a document into discrete pages, either electronic pages or printed pages. In reference to books produced without a computer, pagination can mean the consecutive page numbering to ind ...
s and 2 trillion words in total. December 2010: Google eBooks (Google Editions) was launched in the US. December 2010: Google launched the Ngram Viewer, which collects and graphs data on word usage across its book collection. March 2011: A federal judge rejected the
settlement Settlement may refer to: *Human settlement, a community where people live *Settlement (structural), the distortion or disruption of parts of a building *Closing (real estate), the final step in executing a real estate transaction *Settlement (fina ...
reached between the publishing industry and Google. March 2012: Google passed 20 million books scanned.Howard, Jennife
''Google Begins to Scale Back Its Scanning of Books From University Libraries''
, March 9, 2012
March 2012: Google reached a settlement with publishers. January 2013: The documentary ''
Google and the World Brain ''Google and the World Brain'' is a 2013 documentary movie about the Google Books Library Project directed by Ben Lewis, produced by BBC, Polar Star Films and Arte. The main focus in the plot is on copyright controversy caused by the project that ...
'' was shown at the
Sundance Film Festival The Sundance Film Festival (formerly Utah/US Film Festival, then US Film and Video Festival) is an annual film festival organized by the Sundance Institute. It is the largest independent film festival in the United States, with more than 46,66 ...
. November 2013: Ruling in ''
Authors Guild v. Google ''Authors Guild v. Google'' 721 F.3d 132 (2d Cir. 2015) was a copyright case heard in the United States District Court for the Southern District of New York, and on appeal to the United States Court of Appeals for the Second Circuit between 2005 ...
'', US District Judge
Denny Chin Denny Chin (陳卓光; born April 13, 1954) is a Senior United States circuit judge of the United States Court of Appeals for the Second Circuit, based in New York City. He was a United States District Judge of the United States District Court fo ...
sides with Google, citing fair use. The authors said they would appeal. October 2015: The appeals court sided with Google, declaring that Google did not violate copyright law. According to the New York Times, Google has scanned more than 25 million books. April 2016: The US Supreme Court declined to hear the Authors Guild's appeal, which means the lower court's decision stood, and Google would be allowed to scan library books and display snippets in search results without violating the law.


Status

Google has been quite secretive regarding its plans on the future of the Google Books project. Scanning operations had been slowing down since at least 2012, as confirmed by the librarians at several of Google's partner institutions. At University of Wisconsin, the speed had reduced to less than half of what it was in 2006. However, the librarians have said that the dwindling pace could be a natural result of maturation of the project – initially stacks of books were entirely taken up for scanning whereas now only the titles that had not already been scanned needed to be considered. The company's own Google Books timeline page did not mention anything after 2007 even in 2017, and the Google Books blog was merged into the Google Search blog in 2012. Despite winning the decade-long litigation in 2017, ''
The Atlantic ''The Atlantic'' is an American magazine and multi-platform publisher. It features articles in the fields of politics, foreign affairs, business and the economy, culture and the arts, technology, and science. It was founded in 1857 in Boston, ...
'' has said that Google has "all but shut down its scanning operation." In April 2017, ''
Wired ''Wired'' (stylized as ''WIRED'') is a monthly American magazine, published in print and online editions, that focuses on how emerging technologies affect culture, the economy, and politics. Owned by Condé Nast, it is headquartered in San Fra ...
'' reported that there were only a few Google employees working on the project, and new books were still being scanned, but at a significantly lower rate. It commented that the decade-long legal battle had caused Google to lose its ambition.


Legal issues

Through the project, library books were being digitized somewhat indiscriminately regardless of copyright status, which led to a number of lawsuits against Google. By the end of 2008, Google had reportedly digitized over seven million books, of which only about one million were works in the public domain. Of the rest, one million were in copyright and in print, and five million were in copyright but out of print. In 2005, a group of authors and publishers brought a major class-action lawsuit against Google for infringement on the copyrighted works. Google argued that it was preserving "orphaned works" – books still under copyright, but whose copyright holders could not be located. The
Authors Guild The Authors Guild is America's oldest and largest professional organization for writers and provides advocacy on issues of free expression and copyright protection. Since its founding in 1912 as the Authors League of America, it has counted among ...
and
Association of American Publishers The Association of American Publishers (AAP) is the national trade association of the American book publishing industry. AAP lobbies for book, journal, and education publishers in the United States. AAP members include most of the major commercia ...
separately sued Google in 2005 for its book project, citing "massive
copyright infringement Copyright infringement (at times referred to as piracy) is the use of works protected by copyright without permission for a usage where such permission is required, thereby infringing certain exclusive rights granted to the copyright holder, s ...
." Google countered that its project represented a
fair use Fair use is a doctrine in United States law that permits limited use of copyrighted material without having to first acquire permission from the copyright holder. Fair use is one of the limitations to copyright intended to balance the interests ...
and is the digital age equivalent of a
card catalog A library catalog (or library catalogue in British English) is a register of all bibliographic items found in a library or group of libraries, such as a network of libraries at several locations. A catalog for a group of libraries is also c ...
with every word in the publication indexed. The lawsuits were consolidated, and eventually a settlement was proposed. The settlement received significant criticism on a wide variety of grounds, including antitrust, privacy, and inadequacy of the proposed classes of authors and publishers. The settlement was eventually rejected, and the publishers settled with Google soon after. The Authors Guild continued its case, and in 2011 their proposed class was certified. Google appealed that decision, with a number of amici asserting the inadequacy of the class, and the Second Circuit rejected the
class certification A class action, also known as a class-action lawsuit, class suit, or representative action, is a type of lawsuit where one of the parties is a group of people who are represented collectively by a member or members of that group. The class action ...
in July 2013, remanding the case to the District Court for consideration of Google's
fair use Fair use is a doctrine in United States law that permits limited use of copyrighted material without having to first acquire permission from the copyright holder. Fair use is one of the limitations to copyright intended to balance the interests ...
defense. In 2015 Authors Guild filed another appeal against Google to be considered by the 2nd U.S. Circuit Court of Appeals in New York. Google won the case unanimously based on the argument that they were not showing people the full texts but instead snippets, and they are not allowing people to illegally read the book. In a report, courts stated that they did not infringe on copyright laws, as they were protected under the fair use clause. Authors Guild tried again in 2016 to appeal the decision and this time took their case to be considered by the Supreme Court. The case was rejected, leaving the Second Circuit's decision on the case intact, meaning that Google did not violate copyright laws. This case also set a precedent for other similar cases in regards to fair use laws, as it further clarified the law and expanded it. Such clarification affects other scanning projects similar to Google. Other lawsuits followed the Authors Guild's lead. In 2006 a German lawsuit, previously filed, was withdrawn. In June 2006, Hervé de la Martinière, a French publisher known as La Martinière and
Éditions du Seuil Éditions du Seuil (), also known as ''Le Seuil'', is a French publishing house established in 1935 by Catholic intellectual Jean Plaquevent (1901–1965), and currently owned by La Martinière Groupe. It owes its name to this goal "The ''seuil'' ...
, announced its intention to sue Google France. In 2009, the Paris Civil Court awarded 300,000
EUR The euro (symbol: €; code: EUR) is the official currency of 19 out of the member states of the European Union (EU). This group of states is known as the eurozone or, officially, the euro area, and includes about 340 million citizens . Th ...
(approximately 430,000
USD The United States dollar (symbol: $; code: USD; also abbreviated US$ or U.S. Dollar, to distinguish it from other dollar-denominated currencies; referred to as the dollar, U.S. dollar, American dollar, or colloquially buck) is the official ...
) in damages and interest and ordered Google to pay 10,000 EUR a day until it removes the publisher's books from its database. The court wrote, "Google violated author copyright laws by fully reproducing and making accessible" books that Seuil owns without its permission and that Google "committed acts of breach of copyright, which are of harm to the publishers". Google said it will appeal. Syndicat National de l'Edition, which joined the lawsuit, said Google has scanned about 100,000 French works under copyright. In December 2009, Chinese author
Mian Mian Mian Mian (, born 28 August 1970 in Shanghai) is a Chinese Post 70s Generation writer. She writes on China's once-taboo topics and she is a promoter of Shanghai's local music. Her publications have earned her the reputation as China's literary ...
filed a civil lawsuit for $8,900 against Google for scanning her novel, ''Acid Lovers''. This is the first such lawsuit to be filed against Google in China. Also, in November that year, the China Written Works Copyright Society (CWWCS) accused Google of scanning 18,000 books by 570 Chinese writers without authorization. Google agreed on Nov 20 to provide a list of Chinese books it had scanned, but the company refused to admit having "infringed" copyright laws. In March 2007, Thomas Rubin, associate general counsel for copyright, trademark, and trade secrets at Microsoft, accused Google of violating copyright law with their book search service. Rubin specifically criticized Google's policy of freely copying any work until notified by the copyright holder to stop. Google licensing of public domain works is also an area of concern due to using of
digital watermarking A digital watermark is a kind of marker covertly embedded in a noise-tolerant signal such as audio, video or image data. It is typically used to identify ownership of the copyright of such signal. "Watermarking" is the process of hiding digital inf ...
techniques with the books. Some published works that are in the public domain, such as all works created by the U.S. Federal government, are still treated like other works under copyright, and therefore locked after 1922.


Similar projects

*
Project Gutenberg Project Gutenberg (PG) is a Virtual volunteering, volunteer effort to digitize and archive cultural works, as well as to "encourage the creation and distribution of eBooks." It was founded in 1971 by American writer Michael S. Hart and is the ...
is a volunteer effort to digitize and archive cultural works, to "encourage the creation and distribution of eBooks". It was founded in 1971 by Michael S. Hart and is the oldest digital library. , Project Gutenberg reached 50,000 items in its collection. *
Internet Archive The Internet Archive is an American digital library with the stated mission of "universal access to all knowledge". It provides free public access to collections of digitized materials, including websites, software applications/games, music, ...
is a non-profit which digitizes over 1000 books a day, as well as mirrors books from Google Books and other sources. , it hosted over 2.8 million public domain books, greater than the approximate 1 million public domain books at Google Books.
Open Library Open Library is an online project intended to create "one web page for every book ever published". Created by Aaron Swartz, Brewster Kahle, Alexis Rossi, Anand Chitipothu, and Rebecca Malamud, Open Library is a project of the Internet Archive, ...
, a sister project of Internet Archive, lends 80,000 scanned and purchased commercial ebooks to the visitors of 150 libraries. *
HathiTrust HathiTrust Digital Library is a large-scale collaborative repository of digital content from research libraries including content digitized via Google Books and the Internet Archive digitization initiatives, as well as content digitized locally ...
maintains HathiTrust Digital Library since October 13, 2008, which preserves and provides access to material scanned by Google, some of the Internet Archive books, and some scanned locally by partner institutions. , it includes about 6 million volumes, over 1 million of which are public domain (at least in the US). *
ACLS Humanities E-Book
an online collection of over 5,400 books of high quality in the humanities and related social sciences, accessible through institutional subscription. * Microsoft funded the scanning of 300,000 books to create
Live Search Books Live Search Books was a search service for books launched in December 2006, part of Microsoft's Live Search range of services. Microsoft was working with a number of libraries, including the British Library, to digitize books and make them searcha ...
in late 2006. It ran until May 2008, when the project was abandoned and the books were made freely available on the Internet Archive. * The
National Digital Library of India The National Digital library of India is a virtual repository of learning resources which is not only just a repository with a search/browse facilities but also provides a host of services containing textbooks, articles, videos, audio books, l ...
(NDLI) is a project under Ministry of Human Resource Development, India. The objective is to integrate several national and international digital libraries in one single web-portal. The NDLI provides free of cost access to many books in English and the Indian languages. *
Europeana Europeana is a web portal created by the European Union containing digitised cultural heritage collections of more than 3,000 institutions across Europe. It includes records of over 50 million cultural and scientific artefacts, brought togethe ...
links to roughly 10 million digital objects , including video, photos, paintings, audio, maps, manuscripts, printed books, and newspapers from the past 2,000 years of European history from over 1,000 archives in the European Union. * Gallica from the French National Library links to about 4,000,000 digitized books, newspapers, manuscripts, maps and drawings, etc. Created in 1997, the digital library continues to expand at a rate of about 5000 new documents per month. Since the end of 2008, most of the new scanned documents are available in image and text formats. Most of these documents are written in French. *
Wikisource Wikisource is an online digital library of free-content textual sources on a wiki, operated by the Wikimedia Foundation. Wikisource is the name of the project as a whole and the name for each instance of that project (each instance usually rep ...
*
Runivers Runivers ( rus, Руниверс) is a site devoted to Russian culture and history. Runivers targets Russian speaking readers and those interested in Russian culture and history. Runivers is an online library aimed to provide free access to aut ...


See also

*
A9.com A9.com is a former subsidiary of Amazon that develops search engine and search advertising technology. A9 is based in Palo Alto, California, with teams in Seattle, Bangalore, Beijing, Dublin, Iași, Munich and Tokyo. A9 has development effo ...
,
Amazon.com Amazon.com, Inc. ( ) is an American multinational technology company focusing on e-commerce, cloud computing, online advertising, digital streaming, and artificial intelligence. It has been referred to as "one of the most influential economi ...
's book search *
Book Rights Registry The Book Rights Registry is an entity to be founded as part of a settlement of the lawsuit between the Authors Guild and Google over the Google Books scanning project. The Registry will be initially funded by $34.5 million from Google but it will b ...
*
Digital library A digital library, also called an online library, an internet library, a digital repository, or a digital collection is an online database of digital objects that can include text, still images, audio, video, digital documents, or other digital me ...
*
List of digital library projects This is a list of digital library projects. See also * Bibliographic database * List of academic databases and search engines * List of online databases * List of online encyclopedias * List of open-access journals * List of search engines Re ...
*
Universal library A universal library is a library with universal collections. This may be expressed in terms of it containing all existing information, useful information, all books, all works (regardless of format) or even all possible works. This ideal, althoug ...
* National electronic library


References


Further reading

* *


External links

* *
About Google Books
* * * * * {{Authority control Computer-related introductions in 2004 Full-text scholarly online databases
Books A book is a medium for recording information in the form of writing or images, typically composed of many pages (made of papyrus, parchment, vellum, or paper) bound together and protected by a cover. The technical term for this physical ar ...
Books A book is a medium for recording information in the form of writing or images, typically composed of many pages (made of papyrus, parchment, vellum, or paper) bound together and protected by a cover. The technical term for this physical ar ...
Scholarly search services